Evidence of Gas Phase Glucosyl Transfer and Glycation in the CID/HCD-Spectra of S-Glucosylated Peptides

Protein cysteine S-glycosylation is a relatively rare and less well characterized post-translational modification (PTM). Creating reliable model proteins that carry this modification is challenging. The lack of available models or natural S-glycosylated proteins significantly hampers the development of mass-spectrometry-based (MS-based) methodologies for detecting protein cysteine S-glycosylation in real-world proteomic studies. There is also limited MS-sequencing data describing it as easier to create synthetic S-glycopeptides. Here, we present the results of an in-depth manual analysis of automatically annotated CID/HCD spectra for model S-glucopeptides. The CID spectra show a long series of y/b-fragment ions with retained S-glucosylation, regardless of the dominant m/z signals corresponding to neutral loss of 1,2-anhydroglucose from the precursor ions. In addition, the spectra show signals manifesting glucosyl transfer from the cysteine position onto lysine, arginine (Lys, Arg) side chains, and a peptide N-terminus. Other spectral evidence indicates that the N-glucosylated initial products of transfer are converted into N-fructosylated (i.e., glycated) structures due to Amadori rearrangement. We discuss the peculiar transfer of the glucose oxocarbenium ion (Glc+) to positively charged guanidinium residue (ArgH+) and propose a mechanism for the gas-phase Amadori rearrangement involving a 1,2-hydride ion shift.


Introduction
Cellular glycosylation of proteins is an enzyme-directed process that leads to diverse protein-glycan structures, which are responsible for driving particular biological functions [1].Most glycans are assembled from a limited set of reducing monosaccharides linked by O, O-acetal bridges, which are themselves attached through O-or N-glycosidic bonds to side chains of serine/threonine (Ser/Thr) or asparagine (Asn) residues at specific locations on the polypeptide chains [2].These major types of glycosidic bonds have distinctive properties, which can be analyzed using well-established MS-based technologies [3].
The chemical synthesis of glycans and glycoconjugates has enabled the development of modern glycoproteomics by providing reference samples for systematic biological studies [4].In general, the synthesis of complex glycan intermediates relies on the transfer of the glycosyl residue from a selected, activated donor to a saccharide serving as an acceptor molecule.In classic chemical strategies, the acceptor has a target hydroxyl group accessible for the formation of a glycosidic linkage, whereas the reducing end of the growing glycan chain is orthogonally protected to permit the eventual assembly of final protein glycoconjugates [5].Another original approach achieves control over the regio-and stereochemistry of glycan synthesis by intramolecular glycosyl transfer.However, this strategy is not frequently used due to the great difficulty of acquiring sophisticated molecules concurrently exhibiting glycosyl donor and glycosyl acceptor functionalities [6].
Thioglycosides are favorable glycosyl donors due to their efficient activation by various thiophilic promoter systems [7].Chemical glycosylation is accomplished in extra-dry organic solvents to protect the activated donors from hydrolysis.In addition to the desired glycosylation product, the reaction typically generates a byproduct with no nucleophilic activity-the thioaglycon.The thioaglycon often remains tightly bound to the promoter used for activation and should be removed from the reaction mixture.
In contrast to extensively studied, structurally complex N-and O-glycosylated proteins, S-glycosylation involving protein cysteines is considered a rare post-translational modification (PTM).The S-linked glycans identified to date are mono-saccharides or short oligo-saccharides made of Glc, Gal, and GlcNAc residues.The distinctive mono-ADP-Sribosyl unit attached to proteins can also be categorized as S-glycans [8].
The experimental results of MS/MS-sequencing of glycopeptides show that S-glycosidic bonds in CID-type MS fragmentation are more stable than O-glycosidic linkages.This feature can help to determine the location of S-glycosylated cysteines in many proteins [9][10][11][12].For example, by employing a routine proteomic workflow with final LC-ESI tandem mass spectrometry, researchers have detected S-glycosylated catalytic cysteines transiently occurring in glycosyl-enzyme intermediates.Based on these findings, a retaining mechanism for catalysis of two mutant glycosyltransferases has been proposed [13].Nonetheless, detecting low-level S-linked glycosylation among ubiquitous O-and N-glycosylated proteins remains challenging and requires new tailored analytical methods [14].
Our previous work described a chemically modified lysozyme (P-00698) randomly S-glucosylated at cysteine positions available in the protein molecule [14].The locations of the S-modified sites were detected directly using routine mass spectrometric methods and indirectly after specific S-tagging of the S-glucosylated positions.As a result, a large library of CID/HCD fragmentation spectra was created for the model S-glucosylated and S-tagged peptides.Here, we present the results of a manual analysis of complex MS/MS spectra of selected S-glucosylated peptides from the previously created library.In the CID spectra, signals showing 162 Da neutral loss from the precursor ions dominate, accompanied by a series of y/b-fragment ions retaining intact S-glucosyl moieties.In addition, numerous deciphered signals provide spectral evidence of glucosyl transfer from cysteine onto Lys/Arg side chains and the N-terminals of the fragmenting peptides.Progressing the S→N glucosyl transfer leads to the formation of N-glucosylated derivatives and N-fructosylated derivatives known as Amadori products [15].We propose hypothetical mechanisms for these gas-phase transformations.

Results
The present study builds on our previous work [14], in which CID/HCD fragmentation spectra of S-glucopeptides were identified using the Byonic sequence search engine.The created CID/HCD spectral library contained 90 unique sequences of S-glucopeptides.In the present study, a set of spectra from the CID/HCD spectral library was selected for meticulous manual analysis.The selected spectra included precursor ions of charge state z = +2 or z = +3, with one S-glucosylated cysteine position.The omitted spectra possessed long sequences bearing more modifications and a precursor ion charge state z > 3.
Initial manual analysis was focused on identifying m/z signals corresponding to glucosyl oxocarbenium ion (Glc+, 163 Da) transfer from the Cys residue onto side chains of other amino acid residues.
The automatically annotated long series of b/y fragment ions with retained C[S-Glc]modification were typical for most of the CID spectra and proved the moderate stability of the modification.Manual searching revealed the presence of b/y fragment ions with no Cys in their sequences but bearing a mass gain of 162 Da.This was linked to Glc+ ion transfer from cysteine to other amino acid residues within fragmenting peptides.In particular, [b+162]+/++ fragment ions corresponding to short N-terminal sequences and [y+162]+/++ comprising Lys or Arg on C-termini were detected.Unexpectedly, fragment ions denoted as [y/b+162+162−losses]+/++ were also revealed.This implies the transfer of two Glc+ ions onto fragments with no Cys residue in their sequences.
In the HCD spectra, [M + zH + − 162]/z ions were weak or absent.A long series of y-type fragment ions from +2 and +3 charged precursors was observed, as presented in Section 2.3.2.

Literature-Based Characteristics of Peptidyl N-Glucosylated Arg and Lys MS-Fragmentation
Although not previously described in the literature, Glc+ ion migration onto Lys primary amine groups or peptide N-termini in a gas phase seemed plausible and worth further exploration.Yet, a similar glucosyl transfer onto protonated arginine residues raised questions about the mechanism of transformation.The assumed formation of Nglucosylated arginine with subsequent Amadori rearrangement would lead to the cognate isobaric product (i.e., 1-amino-1-deoxy-D-fructoside), which should be distinguishable in the CID spectra [16].Therefore, we conducted a thorough literature survey on the use of mass spectrometry in protein glycation studies.In particular, we focused on published data referring to the diagnostic fragmentation features of peptidyl N-glucosides, N-fructosides, and selected N-glycopeptides.The results are summarized in Table 1.Our review of the literature data enabled us to establish a general map of N-glucosylated peptidyl arginine CID/HCD fragmentation, which is presented in Figure 1.This map is applicable also to the CID fragmentation of N-glucosylated peptidyl lysine-i.e., if the pictured guanidinium group is replaced by the primary amino group.Then, 204 Da, 54 Da, and 24 Da diagnostic ions are not expected.The distinguished paths A, B, and C illustrate the main fragmentation routes, termed as follows:

•
Path C: peptidyl N-fructoside dehydration (Amadori product dehydration path) N-glucosides of primary amines are cyclic hemi-aminals (in the open form known as Schiff bases).Under physiological conditions, they hydrolyze easily or undergo Amadori rearrangement catalyzed by protic acids into isobaric N-fructosides (i.e., 1-amino-1-deoxy -D-fructosides) [16,17].To the best of our knowledge, the gas phase Amadori rearrangement of N-glucosylated peptides (under CID/HCD fragmentation conditions) has not been described previously.However, it is worth highlighting earlier studies that describe the CID fragmentation patterns of N-glucosylated asparagine in peptides [18] and discuss 0,2 A n cross-ring cleavage as a general diagnostic tool for glycan assignment in glycoconjugate mixtures [19].N-glucosides of primary amines are cyclic hemi-aminals (in the open form known as Schiff bases).Under physiological conditions, they hydrolyze easily or undergo Amadori rearrangement catalyzed by protic acids into isobaric N-fructosides (i.e., 1-amino-1-deoxy -D-fructosides) [16,17].To the best of our knowledge, the gas phase Amadori rearrangement of N-glucosylated peptides (under CID/HCD fragmentation conditions) has not been described previously.However, it is worth highlighting earlier studies that describe the CID fragmentation patterns of N-glucosylated asparagine in peptides [18] and discuss 0,2 A n cross-ring cleavage as a general diagnostic tool for glycan assignment in glycoconjugate mixtures [19].The detected signs of glucosyl (Glc+) transfer onto protonated arginine residue (ArgH+) led us to review the literature on the application of mass spectrometry in studies of protein arginine glycosylation [20].It is known that arginine residue in vivo can be N-glucosylated, N-GlcNAcetylated, N-rhamnosylated [21], and mono-ADP-N-ribosylated due to enzyme-catalyzed processes [22].These modifications are prone to glycan and carbodiimide (42 Da, CN 2 H 2 ) loss during CID.Therefore, it seemed that the 204 NL sig-nal (162 Da + carbodiimide = 204 Da) might be a useful diagnostic marker for detecting N-glucosylated arginine in the CID spectra of our S-glucosylated peptides.
Under physiological conditions, the primary amino groups within proteins may follow non-enzymatic condensation with aldehydes, typically reducing sugars such as glucose.This reaction proceeds through an acyclic imine (Schiff base), followed by Amadori rearrangement to form so-called glycation products (e.g., N-fructosylated derivatives) [15,23].Protein glycation can alter biomolecule natural functionalities and ultimately lead to abnormal biological processes.This important topic has been discussed in a large number of studies.In many of those studies, mass spectrometry was instrumental in detecting glycation structures [24][25][26][27][28].

Detailed Analysis of MS-Fragmentation Spectra for S-Glucopeptides
Fragmentation spectra of the following tryptic S-glucosylated peptides were subjected to detailed manual analysis: S1 and S2; Figures S5 and S6) S3; Figures S7 and S8) S5) These sequences have C-terminal lysine or arginine and single S-glucosylated cysteine located at the N-termini or within the peptide chain.As the exemplary sequence, C[+162.]KGTDVQAWIR was selected due to the accessibility of CID and HCD spectra for both +2 and +3 charged precursors.The results of the manual analysis are presented in Section 2.3, corresponding to the logic of Figure 1.The centrally positioned peptide sequence guides the reader over the data arranged in vertical and horizontal lines.The band y-ions were automatically annotated.Their total abundance was calculated separately for each ion type and considered as 100%, or the value of 1. Manually deciphered fragment ions arranged in columns represent entities of the same cleavage position but differing by mass loss or gain.Appropriately marked, diagnostic fragment ions derived from the decomposition of b-ions are placed in the upper rows.They indicate gas phase events on the precursor N-terminus.Changes in the precursor C-terminus are manifested by y-type diagnostic ions, analogically ordered in the lower rows.Their total abundance was expressed as the relative abundance versus the total abundance calculated for y ions.

CID-Fragmentation Analysis of C[+162.]KGTDVQAWIR Sequence
Fragment ions revealed in the CID spectrum of the +2 charged precursor (Figure 2) are presented in Tables 2 and 3.The long series of signals corresponding to this particular type of diagnostic ions provides evidence of molecular transformation processes ongoing in the gas phase, which are explained in an additional table (Table 4).
As can be seen in Table 3, the fragment ions b2 + 162, b3 + 162, and b8 + 162 carry an additional hexosyl residue, most likely attached either to the lysine residue or to the cysteine N-terminus, apparently due to an intermolecular glucosyl transfer.The diagnostic ions b1 + 144, b1 + 126, and b1 + 108 likely indicate an S→N glucosyl shift followed by Amadori rearrangement to N-fructosylated cysteine.
The general characteristics of the CID fragmentation patterns for the S-glucosylated peptides were presented in the Section 2.1.Diagnostic fragment ions from the CID spectrum of the +3 charged precursor are collected in Tables 5 and 6 and commented on in a table (Table 7).Diagnostic fragment ions from the CID spectrum of the +3 charged precursor are collected in Tables 5 and 6 and commented on in a table (Table 7).

HCD-Fragmentation Analysis of the C[+162.]KGTDVQAWIR Sequence
A few ions confirming glucosyl transfer and subsequent Amadori rearrangement populate the HCD spectra of the +2 and +3 charged precursors (Figures 4 and 5, respectively).The fragment ions containing a hydroxymethyl-imidazole ring [y+54]+/++ are slightly more abundant than the others.Manual analysis of selected peptides with N-terminal lysine and S-glucosylated cysteine located within the sequence confirmed all observations and conclusions presented in the previous sections.These results are presented in the Supplementary Materials (Tables S6-S8, Figures S11 and S12).

Discussion
The CID spectra of S-glucosylated peptides reveal the presence of automatically assigned principal fragment ions along with numerous unidentified m/z signals, including high-intensity peaks.Manual examination of the spectra enabled us to assign the fragment ions to many previously undefined m/z signals.Their presence sheds light on the side transformations concurrent with the MS/MS-sequencing process.

Discussion
The CID spectra of S-glucosylated peptides reveal the presence of automatically assigned principal fragment ions along with numerous unidentified m/z signals, including high-intensity peaks.Manual examination of the spectra enabled us to assign the fragment ions to many previously undefined m/z signals.Their presence sheds light on the side transformations concurrent with the MS/MS-sequencing process.

Discussion
The CID spectra of S-glucosylated peptides reveal the presence of automatically assigned principal fragment ions along with numerous unidentified m/z signals, including high-intensity peaks.Manual examination of the spectra enabled us to assign the fragment ions to many previously undefined m/z signals.Their presence sheds light on the side transformations concurrent with the MS/MS-sequencing process.
The mechanisms of these molecular transformations in a gas phase demand explication in the context of analogous phenomena reported in the literature for ion/ion and ion/molecule reactions [29,30].The postulated transfer of the glucose oxocarbenium ion Glc+ onto the guanidinium moiety of arginine (ArgH+) seems puzzling, for it would require difficult deprotonation of (ArgH+), enabling N-glucosylation of the uncharged arginine side chain.

Reported Spectral Evidence of Gas Phase Glycosyl Transfer
The following examples are drawn from mass spectrometric studies on unique glycans derived from various glycoconjugates.Currently, glycan rearrangement is observed via the migration of small monosaccharides to other intra-glycan positions, using CID analysis.In such cases, the glycan molecule plays a double function as the glycosyl donor and acceptor.Intra-molecular xylose migration has been observed using tandem mass spectrometry of N-linked glycans [31].In studies on the Lewis X (Lex) and blood group antigen H-2, the CID-induced migration of fucose assisted by mobile proton led to the formation of a new O-glycosidic linkage [32].Another intriguing outcome relates to CID-MS/MS analysis of a synthetic neo-glycolipid.As a result of an intramolecular mechanism of O-to-C glycosyl transfer, a C-glycosylated cholesterol derivative formed [33].

Glucosyl Transfer Evidence in CID/HCD Spectra of S-Glucopeptides
Tryptic peptides must cover a long analytical pathway, from protein digestion through chromatographic separation and complex ESI-MS/MS spectra acquisition, to become sequenced [34,35].In the dry gas phase, charged peptides are flexible and continuously shape their conformations via intramolecular hydrogen bonds, salt bridges, hydrophobic interactions, and collisional interactions [36,37].In aqueous conditions, the flexibility of peptides can be determined by measuring their end-to-end collision frequency [38].Peptides' conformational motions in a gas phase can be computed using the eBGF algorithm [39].
In the present study, we selected CID/HCD spectra of S-glucosylated peptides containing the C[S-Glc] modification as a glycosyl ion [Glc+] source and diverse reactive groups located in their side chains.We assumed that under CID conditions the S-glycosidic bonds undergo "trashless" activation due to multiple collisions of the peptide with neutral gas molecules, as well as self-collisional interactions.As a result, a reactive transient ion pair, [Glc+/anh-Glc+][: S-Cys][Peptide] z+ OH , composed of Glc-oxocarbenium ion/protonated 1,2-anhydro-glucose [40] and peptidyl thiolate, may appear.This contact ion pair would be held together by electrostatic attraction.In most cases, collisional activation leads to a neutral loss of 1,2-anhydro-glucose (162 Da) from the precursor ion.The 163 m/z signal of Glc+ may also appear in the MS/MS spectra.
The existing transient ion pair can collide with its conformationally accessible side chains, bearing nucleophilic/electrophilic functional groups-e.g., those located at the peptide C-or N-terminus.In anhydrous gaseous environments, the cysteine thiolate ion, [: S-Cys][Peptide] z+ OH , exhibits variable nucleophilicity and basicity, which depend on the peptide sequence and conformation.Therefore, we postulate that in favored spatial arrangements, self-collisional peptide interactions may result in glucosyl transfer from Cys to side chains of Lys and Arg, or N-terminal amine groups.Such a collisional state brings together the critical structural elements of the glycopeptide-i.e., the glucosyl donor group, acceptor side chains, and groups involved in proton transfer (-NH 2 , -SH, -COOH).Thus, in certain respects, such a collisional state resembles the catalytic center environment of some glycosyltransferases.
Generally, the stereochemistry of collisions and the energies of the colliding partners may control the mechanisms of glucosyl transfer to the Arg guanidinium end.Two routes, denoted Route N and Route B, are postulated.In Route N, the peptidyl thiolate acts as a catalytic nucleophile (N), whereas in following Route B, it behaves as a catalytic base (B). Figure 6 illustrates these two possible mechanisms of arginine N-glycosylation of peptides in a gas phase, with route B also applying for N-glycosylation of the protonated lysine and the peptide N-terminus.
Thus, in certain respects, such a collisional state resembles the catalytic center environment of some glycosyltransferases.
Generally, the stereochemistry of collisions and the energies of the colliding partners may control the mechanisms of glucosyl transfer to the Arg guanidinium end.Two routes, denoted Route N and Route B, are postulated.In Route N, the peptidyl thiolate acts as a catalytic nucleophile (N), whereas in following Route B, it behaves as a catalytic base (B). Figure 6 illustrates these two possible mechanisms of arginine N-glycosylation of peptides in a gas phase, with route B also applying for N-glycosylation of the protonated lysine and the peptide N-terminus.The guanidinium function of arginine is a planar resonance-stabilized structure comprising a central carbon atom of sp2 hybridization, which is linked to three nitrogen atoms by single C-N bonds [41].The approaching thiolate anion neutralizes the positive charge of guanidinium carbon and changes the orbital hybridization of both carbon and nitrogen atoms.The resulting uncharged transient thioether cyclopeptide has tetrahedral carbon connected to three sp3-hybridized amine groups.This sterically crowded and electronrich structural element traps the proximate Glc+ ion to form a C1-N+ glycosidic linkage with an accessible primary amine group.The positive charge of the created transitional structure is located on a single nitrogen atom and is not resonance-stabilized.Finally, the reaction of a thioether ring opening liberates the side chain's cysteine thiol and restores the energetically favorable planar guanidinium N-glucosylated function.
In summary, the proposed mechanism of glucosyl transfer appears feasible through the sp2 to sp3 hybridization changes in the carbon and nitrogen atoms of the arginine guanidinium function.Such a rehybridization can be described as "structural pyramidalization," which enhances the nucleophilicity of nitrogen atoms.
The concept of enzymatic electrophilic functionalization of the arginine guanidinium group, omitting its deprotonation, was developed in studies on arginine kinases by Falcioni et al. [42].These authors proposed a sophisticated mechanism of Arg The guanidinium function of arginine is a planar resonance-stabilized structure comprising a central carbon atom of sp2 hybridization, which is linked to three nitrogen atoms by single C-N bonds [41].The approaching thiolate anion neutralizes the positive charge of guanidinium carbon and changes the orbital hybridization of both carbon and nitrogen atoms.The resulting uncharged transient thioether cyclopeptide has tetrahedral carbon connected to three sp3-hybridized amine groups.This sterically crowded and electron-rich structural element traps the proximate Glc+ ion to form a C1-N+ glycosidic linkage with an accessible primary amine group.The positive charge of the created transitional structure is located on a single nitrogen atom and is not resonance-stabilized.Finally, the reaction of a thioether ring opening liberates the side chain's cysteine thiol and restores the energetically favorable planar guanidinium N-glucosylated function.
In summary, the proposed mechanism of glucosyl transfer appears feasible through the sp2 to sp3 hybridization changes in the carbon and nitrogen atoms of the arginine guanidinium function.Such a rehybridization can be described as "structural pyramidalization," which enhances the nucleophilicity of nitrogen atoms.
The concept of enzymatic electrophilic functionalization of the arginine guanidinium group, omitting its deprotonation, was developed in studies on arginine kinases by Falcioni et al. [42].These authors proposed a sophisticated mechanism of Arg phosphorylation, involving a process they described as "polarization-pyramidalization" of nitrogen in the arginine side chain.They also suggested that this phenomenon is exploited by many classes of enzymes mediating the post-translational modification of arginine, including N-glycosylation [42].
The presented series of transformations proceeding in a gas phase and resulting in arginine residue N-glucosylation appears analogous to the arginine deiminase mode of action.Arginine deiminase (EC 3.5.3.6)uses the catalytic Cys406 as an essential nucleophile to form an intermediate covalent adduct with the guanidinium carbon of the substrate, followed by ammonia elimination [43].Other less adequate examples exist of liquid phase reactions employing protonated guanidine derivatives and thiolates to perform particular target-directed transformations [44].It has also been reported that the arginine guanidinium ion reacts with a carbanion derived from the condensation of glutaraldehyde with the lysine ε-amine group [45].

2.
Route B: Cysteine thiolate behaves as a catalytic base In the alternative route B, the cysteine thiolate in the [Glc+/anh-Glc+][: S-Cys][Peptide] z+ OH ion pair deprotonates a conformationally accessible guanidinium moiety of protonated ArgH+ (Figure 6d).The resulting (unprotonated) guanidino group, while retaining its planar geometry, acquires nucleophilic properties.Next, it attacks the proximate Glc+ ion to form a charged N-glucosylated Arg+ (Figure 6c).In this variant of the arginine N-glycosylation mechanism, the basicity of the thiolate in a gas phase dictates its role as an initial proton acceptor triggering the subsequent transformation steps.Some experimental facts support this hypothetical sequence of rearrangements.In a liquid phase, the Arg of pKa 13.8 [46] can be N-functionalized using a stronger base-e.g., the Barton base of pKa 15.3 [47,48].Analogously, with NCS reagent in a gas phase, N-acylation of the peptidyl arginine residue was observed exclusively if it existed in an un-protonated form [29,49].It has also been reported that the pKa parameter, calculated for the cysteine residues present in some human kinases, ranges in scope from 7 to even 24 units [50,51].This indicates enormous differences in the reactivity of the cysteine thiols dictated by the protein cavity characteristics.Thus, in favorable surroundings, cysteine can be much more basic than arginine.Studies on the Rhodobacter sphaeroides mitochondrial proton pump provide evidence of the role of Cys-139 as an initial proton acceptor acting in the altered form of Cytochrome c oxidase (CcO) [52].The cited scientific data demonstrate the experimental conditions in which the [:S-Cys]Peptide thiolate behaves either as a strong nucleophile or as a super-basic proton acceptor.

Gas Phase Amadori Rearrangement
We demonstrated in the results section that Lys and Arg N-glucosylated gas phase transfer products experience Amadori rearrangement to N-fructosylated structures.To the best of our knowledge, such events have not been observed previously.The hypothetical mechanism of these transformations is depicted in Figure 7.The process is initiated by the collisional splitting of the C(5)O-C(1) bond in the glucose hemiaminal (Figure 7a), leading to the formation of the guanidinium Schiff base (Figure 7b).Hydride 1,2-migration from C(2) to C(1) results in a new cationic species (Figure 7c).Subsequent ring closure involving the C(6)-OH and C(2)+ cationic centers produces N-fructosylated guanidinium residue of Arg (Figure 7d).The postulated mechanism seems compatible with the low proton mobility environment of a gas phase, although 1,2-hydride shift mechanisms have also been proposed to explain enzymatic glucose to fructose interconversion [53].

Materials and Methods
The fragmentation spectra used in this study were derived from a library reported in a previous study.The list of CID and HCD spectral libraries containing S-glucosylated peptides is accessible in [14] (Table 1), and their sequences can be viewed in the Supplementary Materials at https://link.springer.com/article/10.1007/s00726-022-03208-7(on 1 July 2024).

MS/MS Spectra Acquisition
All CID and HCD fragmentation spectra were generated using an LTQ Orbitrap Velos (Thermo Fisher Scientific, Bergen, Norway) spectrometer working in the regime of data-dependent acquisition.The normalized collision energy was set to 30%.Dynamic exclusion was disabled.Raw instrument data (.RAW) were processed using the Byonic search engine (Protein Metrics Inc., Cupertino, CA, USA, v.0-25) against the Uni-Protac- The 1,2-hydride shift explains the fragmentation pattern of the S-adenosyl-L-methionine observed during CID analysis [54].Analogously, the same phenomenon is proposed for elucidating differences in CID fragmentation of L-leucine and L-isoleucine [55].

Materials and Methods
The fragmentation spectra used in this study were derived from a library reported in a previous study.The list of CID and HCD spectral libraries containing S-glucosylated peptides is accessible in [14] (Table 1), and their sequences can be viewed in the Supplementary Materials at https://link.springer.com/article/10.1007/s00726-022-03208-7(on 1 July 2024).

MS/MS Spectra Acquisition
All CID and HCD fragmentation spectra were generated using an LTQ Orbitrap Velos (Thermo Fisher Scientific, Bergen, Norway) spectrometer working in the regime of data-dependent acquisition.The normalized collision energy was set to 30%.Dynamic exclusion was disabled.Raw instrument data (.RAW) were processed using the Byonic search engine (Protein Metrics Inc., Cupertino, CA, USA, v.0-25) against the Uni-Protaccession_p00698.decoys.fasta.Protein FDR was set to 1% FDR.All searches used a 6-ppm precursor and 20-ppm fragment ion tolerance for HCD, with 0.500 Da tolerance for CID fragmentation.All searches considered tryptic peptides with a maximum of two missed cleavages.

Manual Curation of Diagnostic Fragment Ions
The fragmentation pathways shown in Figure 1 display ions and their lost or gained neutral fragments, the sizes of which were determined using a monoisotopic mass calculator accessible at https://www.sisweb.com/referenc/tools/exactmass.htm(accessed between 20 June 2022 to 31 January 2024).The results are presented in Table 8.Theoretical values of y/b fragment ions for a particular peptide sequence were determined using an MS/MS fragmentation calculator provided by the University of Washington's Proteomics Resource available online at https://proteomicsresource.washington.edu/cgi-bin/fragment.cgi(accessed between 20 June 2022 to 31 January 2024).The obtained data were used to calculate theoretical values for the target diagnostic fragment ions listed in Table 1, Section 2.2.Next, a manual search of selected MS/MS spectra was carried out to identify the m/z signals corresponding to particular diagnostic ions.The Excel sheets containing exemplary calculations and outcomes of manual spectra searches are also included in the Supplementary Materials (Supp Excel files).

Conclusions
The CID/HCD-fragmentation of S-glucopeptides is accompanied by a transfer of glucosyl ion Glc+ from cysteine to C-terminal Lys, Arg, and N-terminal amine groups.The moderate stability of the S-glycosidic bond under CID conditions enables the S-glucosylated precursor ions and their nascent fragment ions to play the double role of donor/acceptor in inter-and intramolecular glycosyl transfer processes.
We postulate that collisional activation of the S-glucosidic linkage may generate species carrying a reactive ion pair structure.The peptidyl thiolate component of the ion pair can display capabilities of both a super-active base and a strong nucleophile.
Such a duality can explain two alternative pathways of glycosyl transfer onto protonated arginine residue.While behaving as a nucleophile, the peptidyl thiolate attacks the guanidinium carbon, neutralizes its positive charge, and drives changes in orbital hybridization-a process we termed "pyramidalization."The resulting transient electronrich tetrahedral structure captures a nearby Glc+ cation and is stabilized by peptidyl thiol elimination.Ultimately, a charged planar N-glucosylated arginine residue is formed.
Glycosylation of cysteine and arginine in proteins is a new research area.Therefore, the results of our work might become informative, inspiring, or useful for a wider group of glycoproteomic communities.

Conflicts of Interest:
The author declares no conflicts of interest.

Figure 1 .
Figure 1.CID fragmentation pathways of N-glucosylated peptides containing Arg or Lys.

Figure 1 .
Figure 1.CID fragmentation pathways of N-glucosylated peptides containing Arg or Lys.

Figure 2 .
Figure 2. CID MS/MS spectrum of doubly charged C[+162.]KGTDVQAWIR.The general characteristics of the CID fragmentation patterns for the S-glucosylated peptides were presented in the Section 2.1.Figure 2 displays multiple neutral loss signals from the +2 charged precursor with dominating 162 NL.The relatively strong 179 NL peak (1.45 × 10 5 ) corresponds to 1-aminoglucose loss from N-glucosylated arginine.The less abundant 204 NL (2.76 × 10 3 ) indicates the combined loss of 1,2-anhydroglucose and carbodiimide.Diagnostic fragment ions from the CID spectrum of the +3 charged precursor are collected in Tables5 and 6and commented on in a table (Table7).

Figure 6 .
Figure 6.Hypothetical mechanisms of gas-phase glucosyl transfer from Cys residue to the guanidinium group of Arg residue.(a) Peptidyl-cysteine thiolate acts as a nucleophile; thioeter cyclopeptide formation, Glc+ capture; (b) N-glucosylated thioether cyclopeptide ring opening; (c) N-glucosylated charged peptidyl arginine (Arg+) formation; (d) Peptidyl-cysteine thiolate acts as a base; guanidinium group deprotonation, Glc+ capture.1. Route N: Cysteine thiolate acts as a catalytic nucleophile We postulate that the [Glc+/anh-Glc+][: S-Cys][Peptide] z+ OH ion pair can collide with a guanidinium group of the C-terminal arginine (ArgH+).Steric interactions and Columb attraction/repulsion forces affect the collisional transient structure and impact the SN glucosyl transfer process.The guanidinium function of arginine is a planar resonance-stabilized structure comprising a central carbon atom of sp2 hybridization, which is linked to three nitrogen atoms by single C-N bonds[41].The approaching thiolate anion neutralizes the positive charge of guanidinium carbon and changes the orbital hybridization of both carbon and nitrogen atoms.The resulting uncharged transient thioether cyclopeptide has tetrahedral carbon connected to three sp3-hybridized amine groups.This sterically crowded and electronrich structural element traps the proximate Glc+ ion to form a C1-N+ glycosidic linkage with an accessible primary amine group.The positive charge of the created transitional structure is located on a single nitrogen atom and is not resonance-stabilized.Finally, the reaction of a thioether ring opening liberates the side chain's cysteine thiol and restores the energetically favorable planar guanidinium N-glucosylated function.In summary, the proposed mechanism of glucosyl transfer appears feasible through the sp2 to sp3 hybridization changes in the carbon and nitrogen atoms of the arginine guanidinium function.Such a rehybridization can be described as "structural pyramidalization," which enhances the nucleophilicity of nitrogen atoms.The concept of enzymatic electrophilic functionalization of the arginine guanidinium group, omitting its deprotonation, was developed in studies on arginine kinases by Falcioni et al.[42].These authors proposed a sophisticated mechanism of Arg

Figure 6 .
Figure 6.Hypothetical mechanisms of gas-phase glucosyl transfer from Cys residue to the guanidinium group of Arg residue.(a) Peptidyl-cysteine thiolate acts as a nucleophile; thioeter cyclopeptide formation, Glc+ capture; (b) N-glucosylated thioether cyclopeptide ring opening; (c) N-glucosylated charged peptidyl arginine (Arg+) formation; (d) Peptidyl-cysteine thiolate acts as a base; guanidinium group deprotonation, Glc+ capture.1. Route N: Cysteine thiolate acts as a catalytic nucleophile We postulate that the [Glc+/anh-Glc+][: S-Cys][Peptide] z+ OH ion pair can collide with a guanidinium group of the C-terminal arginine (ArgH+).Steric interactions and Columb attraction/repulsion forces affect the collisional transient structure and impact the S→N glucosyl transfer process.The guanidinium function of arginine is a planar resonance-stabilized structure comprising a central carbon atom of sp2 hybridization, which is linked to three nitrogen atoms by single C-N bonds[41].The approaching thiolate anion neutralizes the positive charge of guanidinium carbon and changes the orbital hybridization of both carbon and nitrogen atoms.The resulting uncharged transient thioether cyclopeptide has tetrahedral carbon connected to three sp3-hybridized amine groups.This sterically crowded and electron-rich structural element traps the proximate Glc+ ion to form a C1-N+ glycosidic linkage with an accessible primary amine group.The positive charge of the created transitional structure is located on a single nitrogen atom and is not resonance-stabilized.Finally, the reaction of a thioether ring opening liberates the side chain's cysteine thiol and restores the energetically favorable planar guanidinium N-glucosylated function.In summary, the proposed mechanism of glucosyl transfer appears feasible through the sp2 to sp3 hybridization changes in the carbon and nitrogen atoms of the arginine guanidinium function.Such a rehybridization can be described as "structural pyramidalization," which enhances the nucleophilicity of nitrogen atoms.The concept of enzymatic electrophilic functionalization of the arginine guanidinium group, omitting its deprotonation, was developed in studies on arginine kinases by Falcioni et al.[42].These authors proposed a sophisticated mechanism of Arg phosphorylation, involving a process they described as "polarization-pyramidalization" of nitrogen in the arginine side chain.They also suggested that this phenomenon is exploited by many classes of enzymes mediating the post-translational modification of arginine, including N-glycosylation[42].The presented series of transformations proceeding in a gas phase and resulting in arginine residue N-glucosylation appears analogous to the arginine deiminase mode of action.Arginine deiminase (EC 3.5.3.6)uses the catalytic Cys406 as an essential nucleophile to form an intermediate covalent adduct with the guanidinium carbon of the substrate, followed by ammonia elimination[43].Other less adequate examples exist of liquid phase reactions employing protonated guanidine derivatives and thiolates to perform particular target-directed transformations[44].It has also been reported that the arginine guanidinium ion reacts with a carbanion derived from the condensation of glutaraldehyde with the lysine ε-amine group[45].

Funding:
This research was funded by the POLISH MINISTRY of SCIENCE and HIGHER EDUCA-TION, grant number 3195/B/P01/2007/33 (project number N N302 3195 33).Institutional Review Board Statement: Not applicable.Informed Consent Statement: Not applicable.Data Availability Statement: Data is contained within the article and Supplementary Materials.

Table 1 .
The set of diagnostic ions for investigating glycosyl transfer effects.

Table 2 .
The set of fragment ions from the +2 charged precursor, as evidence of fragmentations following paths A, B, and C.

Table 3 .
The set of doubly glycosylated fragment ions and nascent diagnostic ions from the +2 charged precursor, as evidence of fragmentations following paths A, B, and C.

Table 5 .
The set of fragment ions from the +3 charged precursor, as evidence of fragmentations following paths A, B, and C.

Table 6 .
The set of doubly glycosylated and nascent fragment ions from the +3 charged precursor, as evidence of fragmentations following paths A, B, and C.

Table 8 .
List of mass losses and gains for precursors and fragment ions.